# Multimodal Unified Modeling
4M 21 L
Other
4M is an 'any-to-any' foundational model training framework extended to multiple modalities through tokenization and masking techniques
Multimodal Fusion
4
EPFL-VILAB
49
3
Anygpt Base
Apache-2.0
AnyGPT is a multimodal language model that supports arbitrary modal conversion, uniformly processing diverse modalities such as speech, text, images, and music through discrete representations.
Text-to-Image
Transformers English

A
fnlp
452
10
Featured Recommended AI Models